Data Summarization in Relational Databases Through FuzzyDependenciesJ
نویسندگان
چکیده
In this paper we deal with the problem of data summarization through the concept of fuzzy dependency. We introduce a projection operator which leads us to partition a database into two projections with a less amount of information. Thus, we can replace the original relation by those projections. In this process we must guarantee that we can recover the original relation, through the projections, using a special join operator. This process can be done whenever the relation satisses a fuzzy dependency. The projection represents a set of fuzzy rules explaining such a dependency. Also, we show that this deenition of dependency maintains the good properties of completeness as in the classical case. 1 Preliminaries 1.1 Introduction The treatment of non crisp information in databases has been accomplished over the last decade by several authors. The study of incomplete information has been addressed in 16, 17, 18, 21], and the study of uncertain in object data models has been accomplished in 25, 29, 33]. Logic and imprecise information have been studied in 15, 22, 30], whereas fuzzy 32] data models have been introduced in 3, 23, 27]. In this work we deal with the problem of treatment of fuzzy dependencies 9, 12, 4, 23, 24, 31] in fuzzy and non fuzzy (crisp) relational databases. We introduce a generalization of the process of normalization, which leads us to compress the original data appearing in a table of a relational database. This allows us to discover dependencies between attributes in a relation r not detected by classical functional dependencies, and furthermore, decompose r into two new relations called, r 1 and r 2 in such a way that: { r 2 stores the information given in the fuzzy dependency, whereas r 1 represents the other information appearing in r. For new entries to the database, we can test the fuzzy dependency just by looking at the tuples in r 2. { The amount of information stored in r 1 and in r 2 is less than in r. { The original data appearing in r can be recovered (in fuzzy terms) through a fuzzy join of r 1 and r 2. The organization of the work is as follows: in this section we overview basic elements of the relational database model and introduce the notion of resemblance relation which will be used to relax the concept of functional dependency; we also give the …
منابع مشابه
Discovering Regularities in Databases Using Canonical Decomposition of Binary Relations
Regularities in databases are directly useful for knowledge discovery and data summarization. As a mathematical background, relational algebra helped for discovering the main data structures and existing dependencies between the different attributes in a relational database. Functional, difunctional and other kinds of dependencies in a relational database describe invariant regular structures t...
متن کاملVisual SQL-X: A Graphical Tool for Producing XML Documents from Relational Databases
The Visual SQL-X system is presented to generate arbitrarily complex XML documents from a graphical query on a relational database. The query describes the structure of the resulting document through a tree-like representation of its structure. The system generates the XML document through the synthesis of a SQL query and a “summarization” algorithm on the resulting table.
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملFlexible SQLf query based on fuzzy linguistic summaries
Data is often partially known, vague or ambiguous in many real world applications. To deal with such imprecise information, fuzziness is introduced in the classical model. SQLf is one of the practical language to deal with flexible fuzzy querying in Fuzzy DataBases (FDB). However, with a huge amount of fuzzy data, the necessity to work with synthetic views became a challenge for many DB communi...
متن کاملDimensionality Reduction in Data Summarization Approach to Learning Relational Data
Abstrak Due to the growing amount of digital data stored in relational databases, more new approaches are required to learn relational data. The DARA algorithm is designed to summarize data and it is one of the approaches introduced in relational data mining in order to handle data with one-to-many relations. The DARA algorithm transforms data stored in relational databases into a vector space ...
متن کامل